Situation Testing-Based Discrimination Discovery: A Causal Inference Approach
نویسندگان
چکیده
Discrimination discovery is to unveil discrimination against a specific individual by analyzing the historical dataset. In this paper, we develop a general technique to capture discrimination based on the legally grounded situation testing methodology. For any individual, we find pairs of tuples from the dataset with similar characteristics apart from belonging or not to the protected-by-law group and assign them in two groups. The individual is considered as discriminated if significant di↵erence is observed between the decisions from the two groups. To find similar tuples, we make use of the Causal Bayesian Networks and the associated causal inference as a guideline. The causal structure of the dataset and the causal e↵ect of each attribute on the decision are used to facilitate the similarity measurement. Through empirical assessments on a real dataset, our approach shows good e cacy both in accuracy and e ciency.
منابع مشابه
Causal Discrimination Discovery Through Propensity Score Analysis
Social discrimination is considered illegal and unethical in the modern world. Such discrimination is often implicit in observed decisions’ datasets, and anti-discrimination organizations seek to discover cases of discrimination and to understand the reasons behind them. Previous work in this direction adopted simple observational data analysis; however, this can produce biased results due to t...
متن کاملJoint Probabilistic Inference of Causal Structure
Causal directed acyclic graphical models (DAGs) are powerful reasoning tools in the study and estimation of cause and effect in scientific and socio-behavioral phenomena. In many domains where the cause and effect structure is unknown, a key challenge in studying causality with DAGs is learning the structure of causal graphs directly from observational data. Traditional approaches to causal str...
متن کاملBayesian Probabilities for Constraint-Based Causal Discovery
We target the problem of accuracy and robustness in causal inference from finite data sets. Our aim is to combine the inherent robustness of the Bayesian approach with the theoretical strength and clarity of constraint-based methods. We use a Bayesian score to obtain probability estimates on the input statements used in a constraint-based procedure. These are subsequently processed in decreasin...
متن کاملA Logical Characterization of Constraint-Based Causal Discovery
We present a novel approach to constraintbased causal discovery, that takes the form of straightforward logical inference, applied to a list of simple, logical statements about causal relations that are derived directly from observed (in)dependencies. It is both sound and complete, in the sense that all invariant features of the corresponding partial ancestral graph (PAG) are identified, even i...
متن کاملModel-Based Approach to FDR Estimation
False Discovery Rate (FDR) has become commonly used in multiple comparisons, in part driven by studies involving large numbers of parameters, e.g. microarray data analysis. FDR is a philosophically different approach from traditional hypothesis testing, and it offers a conceptually consistent framework for multiple hypothesis testing when the primary objective focuses on discovery instead of te...
متن کامل